3574 results found.
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
<Not Specified>
Size:
246688 tokens Production Status:
Newly created-finished
Use:
Sentiment Analysis
-
Paper title:PACE Corpus: a multilingual corpus of Polarity-annotated textual data from the domains Automotive and CEllphone
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Christian Haenig | ExB Group | DE |
| Author 2 | Andreas Niekler | University of Leipzig | DE |
| Author 3 | Carsten Wuensch | University of Bamberg | DE |
| Main Contact | Christian Haenig | ExB Group | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
100000 entries Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Stress Test Evaluation for Natural Language Inference
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Aakanksha Naik | Carnegie Mellon University | US |
| Author 2 | Abhilasha Ravichander | Carnegie Mellon University | US |
| Author 3 | Norman Sadeh | Carnegie Mellon University | N/A |
| Author 4 | Carolyn Rose | Carnegie Mellon University | US |
| Author 5 | Graham Neubig | Carnegie Mellon University | US |
| Main Contact | Aakanksha Naik | Carnegie Mellon University | None |
Documentation:
<Not Specified>
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0
Size:
33400 tokens Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:Crowdsourcing a Multi-lingual Speech Corpus: Recording, Transcription and Annotation of the CrowdIS Corpora
-
Paper track:Speech
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Andrew Caines | University of Cambridge | GB |
| Author 2 | Christian Bentz | University of Tübingen | DE |
| Author 3 | Calbert Graham | University of Cambridge | GB |
| Author 4 | Tim Polzehl | Quality and Usability Lab, Telekom Innovation Laboratories, TU-Berlin | DE |
| Author 5 | Paula Buttery | University of Cambridge | GB |
| Main Contact | Andrew Caines | University of Cambridge | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
Open Database License (ODbL)
Size:
550 million words Production Status:
Newly created-in progress
Use:
Text Mining
-
Paper title:RtGender: A Corpus for Studying Differential Responses to Gender
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Rob Voigt | Stanford University | US |
| Author 2 | David Jurgens | University of Michigan | US |
| Author 3 | Vinodkumar Prabhakaran | Stanford University | US |
| Author 4 | Dan Jurafsky | Stanford University | US |
| Author 5 | Yulia Tsvetkov | Carnegie Mellon University | US |
| Main Contact | Rob Voigt | Stanford University | None |
Documentation:
<Not Specified>
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
Open Data Commons Attribution License v1.0
Size:
10:47:35 hours Production Status:
Newly created-finished
Use:
Dialogue
-
Paper title:KTH Tangrams: A Dataset for Research on Alignment and Conceptual Pacts in Task-Oriented Dialogue
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Todd Shore | KTH Speech, Music and Hearing | SE |
| Author 2 | Theofronia Androulakaki | KTH Speech, Music and Hearing | SE |
| Author 3 | Gabriel Skantze | KTH Speech, Music and Hearing | SE |
| Main Contact | Todd Shore | KTH Speech, Music and Hearing | None |
Documentation:
An overview of the data structure and annotation scheme is included in PDF format.
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English German
Availability:
From Data Center(s)
License:
Creative Commons BY
Size:
220500 tokens Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:SubCo: A Learner Translation Corpus of Human and Machine Subtitles
-
Paper track:Multimodality
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | José Manuel Martínez Martínez | Universität des Saarlandes | DE |
| Author 2 | Mihaela Vela | Saarland University | None |
| Main Contact | José Manuel Martínez Martínez | Universität des Saarlandes | None |
Documentation:
Yes, English, yesLanguage Type:
Multilingual
Languages:
English Malayalam
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial 4.0 International License
Size:
384613 parallel words Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Lexical Resources to Enrich English Malayalam Machine Translation
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sreelekha S | IIT Bombay | IN |
| Author 2 | Pushpak Bhattacharyya | CSE Department, IIT Bombay | IN |
| Main Contact | Sreelekha S | IIT Bombay | None |
Documentation:
Yes. In English. Yes
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
Dutch English
Availability:
Freely Available
License:
still to be determined
Size:
34 annotated spoken dialogues OtherProduction Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:The DialogBank
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Harry Bunt | Tilburg University | NL |
| Author 2 | Volha Petukhova | Saarland University | DE |
| Author 3 | Andrei Malchanau | Saarland University | DE |
| Author 4 | Kars Wijnhoven | Tilburg University | NL |
| Author 5 | Alex Fang | City University of Hong Kong | CN |
| Main Contact | Harry Bunt | Tilburg University | None |
Documentation:
At the website https://dialogbank.uvt.nl; ISO standard 24617-2; conference papers
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
Speech and gesture data for 67 minutes of interaction OtherProduction Status:
Newly created-in progress
Use:
Dialogue
-
Paper title:A Multimodal Motion-Captured Corpus of Matched and Mismatched Extravert-Introvert Conversational Pairs
-
Paper track:Multimodality
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jackson Tolins | University of California, Santa Cruz | US |
| Author 2 | Kris Liu | University of California, Santa Cruz | US |
| Author 3 | Yingying Wang | University of California, Davis | US |
| Author 4 | Jean E. Fox Tree | University of California, Santa Cruz | US |
| Author 5 | Marilyn Walker | University of California Santa Cruz | US |
| Author 6 | Michael Neff | University of California, Davis | US |
| Main Contact | Jackson Tolins | University of California, Santa Cruz | None |
Documentation:
<Not Specified>
Written
Annotation Tool,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-No Derivative Licence
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:SACR: A Drag-and-Drop Based Tool for Coreference Annotation
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Bruno Oberle | University of Strasbourg | FR |
| Main Contact | Bruno Oberle | University of Strasbourg | None |
Documentation:
<Not Specified>




